Multi-gabor Dictionaries for Audio Time-frequency Analysis

نویسندگان

  • Patrick J. Wolfe
  • Simon J. Godsill
  • Monika Dörfler
چکیده

In this paper we consider the construction of multiresolution Gabor dictionaries appropriate for audio signal analysis. Motivated by a desire for parsimony and efficiency, we propose and formalise the idea of reduced multi-Gabor systems, showing that they constitute a frame for L2(R) and other Hilbert spaces of interest. In order to demonstrate the practicality of such a scheme, we apply it to the atomic decomposition of music and speech signals observed in noise. Qualitative results indicate the potential of this method to yield a salient representation of typical audio signals while at the same time reducing computational costs as compared to a full multiresolution decomposition.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Sparsity and persistence in time-frequency sound representations

It is a well known fact that the time-frequency domain is very well adapted for representing audio signals. The main two features of time-frequency representations of many classes of audio signals are sparsity (signals are generally well approximated using a small number of coefficients) and persistence (significant coefficients are not isolated, and tend to form clusters). This contribution pr...

متن کامل

Time-scaling of Audio Signals with Muti-scale Gabor Analysis

The phase vocoder is a standard frequency domain time-scaling technique suitable for polyphonic audio, but it generates annoying artifacts called phasiness, or loss of presence, and transient smearing, especially for high values of the time-scale parameter. In this paper, a new time-scaling algorithm for polyphonic audio signals is described. It uses a multi-scale Gabor analysis for lowfrequenc...

متن کامل

Multi-View Face Detection in Open Environments using Gabor Features and Neural Networks

Multi-view face detection in open environments is a challenging task, due to the wide variations in illumination, face appearances and occlusion. In this paper, a robust method for multi-view face detection in open environments, using a combination of Gabor features and neural networks, is presented. Firstly, the effect of changing the Gabor filter parameters (orientation, frequency, standard d...

متن کامل

A Gabor Regression Scheme for Audio Signal Analysis

Here we describe novel Bayesian models for time-frequency analysis of non-stationary audio waveforms. These models are based on the idea of a Gabor regression, in which a time series is represented as a superposition of time-frequency shifted versions of a simple window function. Prior distributions over the corresponding time-frequency coefficients are constructed in a manner which favours bot...

متن کامل

Numerical Performance of Time-Frequency Transforms in Lossy Audio Coding

Time-frequency analysis and the transforms that it gives rise to play an important role in digital signal processing. In lossy audio compression, for instance, it has been found that working in the transform domain leads to methods that achieve a much higher level of signal compression than could be achieved in the temporal domain. This article gives an introduction to time-frequency analysis t...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2001